Translation Events in Dutch Cross-Language Information Retrieval
نویسنده
چکیده
The paper describes an analysis of the translation events encountered when queries cross the language barrier in crosslanguage information retrieval. A study of a set of query source and target triples resulted in the creation of a translation taxonomy. The taxonomy was used to code 750 English target queries. The 750 coded queries are currently being used in retrieval experiments to assess the impact of the different translation problems on retrieval performance.
منابع مشابه
Query Translation for Cross-lingual Information Retrieval using Wikipedia
In this paper the system WikiTranslate is introduced that performs query translation for cross-lingual information retrieval (CLIR) that only uses Wikipedia. Queries will be mapped to Wikipedia concepts and the corresponding translations of these concepts in the target language are used to create the final query. WikiTranslate is evaluated by searching with topics in Dutch, French and Spanish i...
متن کاملTranslation Events in Cross-language Information Retrieval: Lexical Ambiguity, Lexical Holes, Vocabulary Mismatch, and Correct Translations
Cross-Language Information Retrieval (CLIR) systems enable users to formulate queries in their native language to retrieve documents in foreign languages. Because queries and documents in CLIR do not necessarily share the same language, translation is needed before matching can take place. This translation step tends to cause a reduction in the retrieval performance of CLIR as compared to monol...
متن کاملCross-Language Information Retrieval using Dutch Query Translation
This paper describes an elementary bilingual information retrieval experiment. The experiment takes Dutch topics to retrieve relevant English documents using Microsoft SQL Server version 7.0. In order to cross the language barrier between query and document, the researchers use query translation by means of a machine-readable dictionary. The Dutch run was void of the typical natural language pr...
متن کاملCross-language information retrieval in Twenty-One: Using one, some or all possible translations?
This paper gives an overview of the tools and methods for Cross-Language Information Retrieval (CLIR) that were developed within the Twenty-One project. The tools and methods are evaluated with the TREC CLIR task document collection using Dutch queries on the English document base. The main issue addressed here is an evaluation of two approaches to disambiguation. The underlying question is whe...
متن کاملMultilingual Domain Modeling in Twenty-One Automatic Creation of a Bi-directional Translation Lexicon from a Parallel Corpus
Within the project Twenty-One, which aims at effective dissemination of information on ecology and sustainable development, a system is developed that supports cross-language information retrieval for any of the four languages Dutch, English, French and German. Knowledge of this application domain is neededto enhanceexisting translation resources for the purpose of lexical disambiguation. This ...
متن کامل